Weight Sharing is Crucial to Succesful Optimization

نویسندگان

  • Shai Shalev-Shwartz
  • Ohad Shamir
  • Shaked Shammah
چکیده

Exploiting the great expressive power of Deep Neural Network architectures, relies on the ability to train them. While current theoretical work provides, mostly, results showing the hardness of this task, empirical evidence usually differs from this line, with success stories in abundance. A strong position among empirically successful architectures is captured by networks where extensive weight sharing is used, either by Convolutional or Recurrent layers. Additionally, characterizing specific aspects of different tasks, making them “harder” or “easier”, is an interesting direction explored both theoretically and empirically. We consider a family of ConvNet architectures, and prove that weight sharing can be crucial, from an optimization point of view. We explore different notions of the frequency, of the target function, proving necessity of the target function having some low frequency components. This necessity is not sufficient only with weight sharing can it be exploited, thus theoretically separating architectures using it, from others which do not. Our theoretical results are aligned with empirical experiments in an even more general setting, suggesting viability of examination of the role played by interleaving those aspects in broader families of tasks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Product quality improvement model considering quality investment in rework policies and supply chain profit sharing

The aim of this paper is to develop an optimization model for quality improvement by considering quality investment in rework policies and supply chain profit sharing. To improve product’s quality, the decision of process target and its tolerance is important since it directly affects the defective rate, manufacturing cost, and loss to customer due to the deviation of product from its specifica...

متن کامل

Bandwidth Optimization Algorithm Based on Weight Vector Adjustment in Generalized Processor Sharing Servers

We consider the bandwidth optimization problem in a Generalized Processor Sharing (GPS) server to minimize the total bandwidth such that QoS requirements for each class queue are satisfied. Our previous optimization algorithm [6] requires rather long optimization time to solve the problem. We propose a new optimization algorithm based on weight vector adjustment. Numerical results show that the...

متن کامل

On merging constraint and optimal control-Lyapunov functions

Merging two Control Lyapunov Functions (CLFs) means creating a single “new-born” CLF by starting from two parents functions. Specifically, given a “father” function, shaped by the state constraints, and a “mother” function, designed with some optimality criterion, the merging CLF should be similar to the father close to the constraints and similar to the mother close to the origin. To successfu...

متن کامل

Legal and Contractual Status of Income Taxation in Upstream Contracts of Oil and Gas Industry with Emphasis on Iranian Petroleum Projects

Fiscal regime of upstream oil and gas contracts is a crucial instrument that impacts sharing of revenue generated from petroleum project between host governments and oil company contractors. This regime consists of a variety of fiscal instruments and mechanisms, some of which have a legal and some others a contractual basis.  The most important legal instrument is project income taxation that i...

متن کامل

A Stochastic Optimization Approach to a Location-Allocation Problem of Organ Transplant Centers

Decision-making concerning thelocation of critical resource on the geographical network is important in many industries.In the healthcare system,these decisions include location of emergency and preventive care. The decisions of location play a crucial role due to determining the travel time between supply and de//////mand points and response time in emergencies.Organs are considered as highly ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1706.00687  شماره 

صفحات  -

تاریخ انتشار 2017